A global assembly of cotton ESTs.

نویسندگان

  • Joshua A Udall
  • Jordan M Swanson
  • Karl Haller
  • Ryan A Rapp
  • Michael E Sparks
  • Jamie Hatfield
  • Yeisoo Yu
  • Yingru Wu
  • Caitriona Dowd
  • Aladdin B Arpat
  • Brad A Sickler
  • Thea A Wilkins
  • Jin Ying Guo
  • Xiao Ya Chen
  • Jodi Scheffler
  • Earl Taliercio
  • Ricky Turley
  • Helen McFadden
  • Paxton Payton
  • Natalya Klueva
  • Randell Allen
  • Deshui Zhang
  • Candace Haigler
  • Curtis Wilkerson
  • Jinfeng Suo
  • Stefan R Schulze
  • Margaret L Pierce
  • Margaret Essenberg
  • Hyeran Kim
  • Danny J Llewellyn
  • Elizabeth S Dennis
  • David Kudrna
  • Rod Wing
  • Andrew H Paterson
  • Cari Soderlund
  • Jonathan F Wendel
چکیده

Approximately 185,000 Gossypium EST sequences comprising >94,800,000 nucleotides were amassed from 30 cDNA libraries constructed from a variety of tissues and organs under a range of conditions, including drought stress and pathogen challenges. These libraries were derived from allopolyploid cotton (Gossypium hirsutum; A(T) and D(T) genomes) as well as its two diploid progenitors, Gossypium arboreum (A genome) and Gossypium raimondii (D genome). ESTs were assembled using the Program for Assembling and Viewing ESTs (PAVE), resulting in 22,030 contigs and 29,077 singletons (51,107 unigenes). Further comparisons among the singletons and contigs led to recognition of 33,665 exemplar sequences that represent a nonredundant set of putative Gossypium genes containing partial or full-length coding regions and usually one or two UTRs. The assembly, along with their UniProt BLASTX hits, GO annotation, and Pfam analysis results, are freely accessible as a public resource for cotton genomics. Because ESTs from diploid and allotetraploid Gossypium were combined in a single assembly, we were in many cases able to bioinformatically distinguish duplicated genes in allotetraploid cotton and assign them to either the A or D genome. The assembly and associated information provide a framework for future investigation of cotton functional and evolutionary genomics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of ESTs from multiple Gossypium hirsutum tissues and identification of SSRs.

In an effort to expand the Gossypium hirsutum L. (cotton) expressed sequence tag (EST) database, ESTs representing a variety of tissues and treatments were sequenced. Assembly of these sequences with ESTs already in the EST database (dbEST, GenBank) identified 9675 cotton sequences not present in GenBank. Statistical analysis of a subset of these ESTs identified genes likely differentially expr...

متن کامل

Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database

A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton E...

متن کامل

Generation and Analysis of a Large-Scale Expressed Sequence Tag Database from a Full-Length Enriched cDNA Library of Developing Leaves of Gossypium hirsutum L

BACKGROUND Cotton (Gossypium hirsutum L.) is one of the world's most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. METHODOLOGY/PRINCIPAL FINDINGS In this study, 9,874 high-quality ESTs were generated from a normalized, full-le...

متن کامل

Accumulation of genome-specific transcripts, transcription factors and phytohormonal regulators during early stages of fiber cell development in allotetraploid cotton.

Gene expression during the early stages of fiber cell development and in allopolyploid crops is poorly understood. Here we report computational and expression analyses of 32 789 high-quality ESTs derived from Gossypium hirsutum L. Texas Marker-1 (TM-1) immature ovules (GH_TMO). The ESTs were assembled into 8540 unique sequences including 4036 tentative consensus sequences (TCs) and 4504 singlet...

متن کامل

Generation, Annotation and Analysis of First Large-Scale Expressed Sequence Tags from Developing Fiber of Gossypium barbadense L

BACKGROUND Cotton fiber is the world's leading natural fiber used in the manufacture of textiles. Gossypium is also the model plant in the study of polyploidization, evolution, cell elongation, cell wall development, and cellulose biosynthesis. G. barbadense L. is an ideal candidate for providing new genetic variations useful to improve fiber quality for its superior properties. However, little...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome research

دوره 16 3  شماره 

صفحات  -

تاریخ انتشار 2006